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Abstract 

The family of skew-symmetric distributions is a wide set of probability density func- 
tions obtained by combining in a suitable form a few components which are selectable 
quite freely provided some simple requirements are satisfied. Intense recent work has 
produced several results for specific sub-families of this construction, but much less is 
known in general terms. The present paper explores some questions within this frame- 
work, and provides conditions on the above-mentioned components to ensure that the 
final distribution enjoys specific properties. 
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1 Introduction and motivation 



1.1 Distributions generated by perturbation of symmetry 

In recent years, there has been quite an intense activity connected to a broad class of continu- 
ous probability distributions which are generated starting from a symmetric density functions 
and applying a suitable form of perturbation of the symmetry. The key representative of this 
formulation is the so-called skew-normal distribution, whose density function in the scalar 
case is given by 

f{x;a) = 2(l){x)^ax), (x G M), (1) 

where (f){x) and denote the N(0,1) density function and distribution function, respect- 
ively, and a is an arbitrary real parameter. When a = 0, (1) reduces to the N(0,1) distribu- 
tion; otherwise an asymmetric distribution is obtained, with skewness having the same sign 
of a. Properties of (1) studied by Azzalini (1985) and by other authors show a number of 
similarities with the normal distribution, and support the adoption of the name skew-normal. 

Furthermore, the same sort of mechanism leading from the normal density function to (1) 
has been applied to other symmetric distributions, including extensions to more elaborate 
forms of perturbation and constructions in the multivariate setting. Introductory accounts 
to this research area are provided by the book edited by Genton (2004) and by the review 
paper of Azzalini (2005), to which the reader is referred for a general overview. 

For the aims of the present paper, we shall largely rely on the following lemma, presented 
by Azzalini and Capitanio (2003). This is very similar to an analogous result developed 
independently by Wang et al. (2004); the precise interconnections between the two statements 
will be discussed in the course of the paper. Before stating the result, we recall that the 
notion of symmetric density function has a simple unique definition only in the univariate 
case, but in the multivariate case there exist different formulations; see Serfiing (2006) for 
an overview. In this paper, we adopt the notion of central symmetry, which in the case of a 
continuous distribution on M'^ requires that a density function p satisfies p{x — Xo) = p{xo — x) 
for all x G M*^, for some centre of symmetry xq. 

Lemma 1 Denote by fo a d- dimensional probability density function centrally symmetric 
about 0, by Go(-) a continuous distribution function on the real line such that go = G'q is an 
even density function, and byw an odd real-valued function onR'^ such that w{—x) = —w{x). 
Then 

fix) = 2 foix) Go{w{x)}, {x e W), (2) 

is a density function. 

This result provides a general mechanism for modifying an initial symmetric 'base' density 
/o via the perturbation factor G{x) = Go{w{x)} , whose components Go and w can be chosen 
among a wide set of options. Clearly, the prominent case (1) can be obtained by setting 
d = 1, fo = (j), Go = ^, w{x) = ax in (2). The term 'skew-symmetric' is often adopted for 
distributions of type (2). An important property associated to Lemma 1 is provided by the 
next statement. 
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Proposition 2 (Perturbation invariance) // the random variable Xq has density /o and 
X has density f, where /o and f satisfy the conditions required in Lemma 1, then the equality 

t{X) I tiXo) , (3) 

where — ' denotes equality in distribution, holds for any even q-dimensional function t on 
R'^, irrespectively of the factor G{x) = Gq{w{x)}. 

1.2 A wealth of open questions 

The intense research work devoted to distributions of type (2) has provided us with a wealth 
of important results. Many of these have however been established for specific subclasses 
of (2). The most intensively studied instance is given by the skew normal density which 
in the case d = \ takes the form (1). Important results have been obtained also for other 
subclasses, especially when /o is the Student's t density or the Subbotin density (also called 
exponential power distribution). 

Much less is known in general terms, in the sense that there still is a relatively limited 
set of results which allow us to establish in advance, on the basis of qualitative properties of 
the components fo,Go,w of (2), what will be the formal properties of the resulting density 
function /. Results of this kind do exist, and Proposition 2 is the most prominent example, 
since it is both completely general and of paramount importance in the associated distribu- 
tion theory; from this property, several results on quadratic forms and even order moments 
follow. Little is known about the distribution of non-even transformations. Among the 
limited results of the latter type, some general properties of odd moments of (2) have been 
presented by Umbach (2006, 2008). There are however many other questions, which arise 
quite naturally in connection with Lemma 1; the following is a non-exhaustive list. 

o In the case d = 1, which assumptions on G{x) ensure that the median of / is larger 
than 0? More generally, when can we say the the p-th quantilc of / is larger than the 
p-th. quantile of /o? Obviously, 'larger' here can be replaced by 'smaller'. 

o The even moments of / and those of /o coincide, because of (3). What can be said 
about the odd moments? For instance, is there an ordering of moments associated to 
some form of ordering of G{x)l 

o If /o is unimodal, which are the additional assumptions on Gq and w which ensure that 
/ is still unimodal? 

o When d > 1 , a related but distinct question is whether high density regions of the type 
Cu = {x : f{x) > u}, for an arbitrary positive u, are convex regions. 

The aim of the present paper is partly to tackle the above questions, but at the same time 
we take a broader view, attempting to make a step forward in understanding the general 
properties of the set of distributions (2). The latter target is the motivation for the prelim- 
inary results of Section 2, which lead to a characterization result in Section 2.2 and provide 
the basis for the subsequent sections which deal with more specific results. In Section 3 
we deal with the case d = 1 and tackle some of the questions listed above. Specifically, 
we obtain quite general results on stochastic ordering of skew-symmetric distributions with 
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common base /o , and these imply orderings of quantiles and of expected values of suitable 
transformations of the original variate. The final part of Section 3 concerns uniqueness of 
the mode of the density /. Section 4 deals with the case of general d, where various results 
are obtained. One of these is to establish convexity of the sets Cu for the more important 
subclass of the skew-elliptical family, provided the parent elliptical family enjoys the same 
property. We also examine the connection between the formulation of skew-elliptical dens- 
ities of type (2) and those of Branco and Dcy (2001), and prove the conjecture of Azzalini 
and Capitanio (2003) that the first formulation strictly includes the second one. Finally we 
gives conditions for the log-concavity of skew-elliptical distributions not generated by the 
conditioning mechanism of Branco and Dey (2001). 



2 Skew-symmetric densities with a common base 
2.1 Preliminciry facts 

Clearly, / in (2) depends on Go only via the perturbation function G{x) = Go{w{x)}. The 
assumptions on Go and w in Lemma 1 ensure that 

G{x)>0, G{x) + G{-x) = 1 , {xeR'^), (4) 

and it is conversely true that a function G satisfying these conditions ensures that 

f{x) = 2fo{x)G{x) (5) 

is a density function. In fact (4)-(5) represent the formulation adopted by Wang et al. (2004) 
for their result essentially equivalent to Lemma 1. 

Each of the two formulations has its own advantages. As remarked by Wang et al. (2004), 
the representation of G(x) in the form G(x) = Go{w{x)} is not unique. In fact, given one 
such representation, 

G{x) = G^iw^x)}, w^x) = G:^[Go{wix)}] 

is another one, for any strictly increasing distribution function G=f with even density function 
on M. 

On the other hand, finding a function G fulfilling conditions (4) is immediate if one 
builds it via the expression G(.x) = Go{u'(a;)}; in fact, this is the usual way adopted in the 
literature to select suitable G functions. Furthermore, Wang et al. (2004) have shown that 
the converse fact holds: any function G satisfying (4) can be written in the form Gq{w{x)}, 
and this can be done in infinitely many ways. A choice of this representation which we find 
'of minimal modification' is 

Go{t) = {t + l) /(-i,i)(2t) + /[i,+oo)(2t) , (t G M) , 
w{x) = G{x) -I, (xeW^), 

where Ia{x) denotes the indicator function of the set A. In plain words, this Go is the 
distribution function of a U{—^, g) variate. 

Another important finding of Wang et al. (2004, Proposition 3) is that any positive density 
function / on M'^ admits a representation of type (5), as indicated in their result which we 
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reproduce next with a little modification concerning the arbitrariness of G{x) outside the 
support of /q. Here and in the following, we denote hy —A the set formed by reversing the 
sign of all elements of A, if A denotes a subset of a Euclidean space. If ^ = —A, we say that 
A is a symmetric set. 

Proposition 3 Let f be a density function with support S C W^. Then a representation of 
type (5) holds, with 



foix) 



otherwise. 



[ arbitrary otherwise. 



(7) 



where Sq = {—S) U S, and the arbitrary branch of G satisfies (4)- Moreover /o is unique, 
and G is uniquely defined over Sq. 



Consider now a density function with representation of type (5). We first introduce a 
property of the cumulative distribution function F which is also of independent interest. 
Rewrite the first relation in (7) as 

f{-x) = 2fo{x)-f{x). (8) 

for any x = {xi,. . . , Xd)- If we denote by Fq the cumulative distribution function of /o, then 
integration of (8) on n^^i(— oo, Xj] gives 

F{-x) = 2Fo{x) ~ F{x) (9) 

where F denotes the survival function, that is F{x) = F{Xi > xi, . . . , > Xd}; (9) can be 
written as 

F{-x) + F{x) = Foix) + Fhi-x) 
and this is in turn equivalent to Proposition 2, as stated in Proposition 4 below. 



2.2 A characterization 



The five single statements composing the next proposition are known for the case d = 1, 
some of them also for general d. The more important novel fact is their equivalence, which 
therefore represents a characterization type of result. 

Proposition 4 Consider a random variable X = iXi, ...X^)^ with density function f and 
cumulative distribution function F, and a continuous random variable Y = {Vi, ...Yd)~^ with 
density function h and distribution function H. Then the following conditions are equivalent: 

(a) the densities fix) and admit a representation of type (5) with the same symmetric 
base density foix), 

(b) tiX) = tiY), for any even q- dimensional function t on W^, 

(c) Fix e A) = P{Y e A), for any symmetric set A C M^, 
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(d) F{x) + F{-x) = H{x) + H{-x), 

(e) fix) + fi-x) = h{x) + h{-x), (a.e.). 

Proof 

(a) =^(b) This follows from the perturbation invariance property of Proposition 2. 

(b) ^(c) Simply notice that the indicator function of a symmetric set A is an even function. 

(c) =>(d) On setting 

= {s = (si, ...,Sd) : Sj < Xj,yj}, 
A_ = {s = (si, ...,Sd) eR'^ : -Sj < Xj^j} = -A+, 
Au = A+UA_, 
An = A+nA_, 

both are symmetric sets; hence we get 

F{x)+F{-x) = P{X e A+) + P{X e A_) , 
= P{x e Au) + Fix e An) . 

(d) =>(e) Taking the d-th mixed derivative of (d), relationship (e) follows. 

(e) =»(a) It follows from the representation given in Proposition 3. 

In the special case d = 1, the above statements can be re- written in more directly 
interpretable expressions. Specifically, (9) leads to 

1 - F{-x) = 2 Fo{x) - F{x) , (10) 

which will turn out to be useful later, and 

F{x) - F{-x) = Fo{x) - Fo{-x). 

Moreover, when d = 1, conditions (c) and (d) in Proposition 4 can be replaced by the 
following more directly interpretable forms: 

(c') \x\ = |y|, 

(d') F{x) - F{-x) = H{x) - H{-x) , 

the first of which has appeared in Azzalini (1986), and the second one is an immediate 
consequence. 
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3 Some results when d = 1 



3.1 Stochastic ordering the univetriate case 

In this section, we focus on the case with d = 1. We first introduce an ordering on the set 

of functions which satisfy (4). When this concept is restricted to symmetric distribution 
functions, it reduces to the pcakcdness order introduced by Birnbaum (1948), to compare 
the variability of distributions about 0. 

Definition 5 If Gi and G2 satisfy (4), we say that G2 is greater than Gi on the right, 
denoted G2 >grGi, if G2{x) > Gi{x) for all x > and strict inequality holds for some x. 

Of course it is equivalent to require that G2{x) < Gi{x) for all x < and the inequality 
holds at some x. Another equivalent condition is that 

G2{s) - G2{r) > Giis) - Gi(r), (r < < s) . 

If we now consider a fixed symmetric 'base' density /o and the perturbed distribution 
functions associated to Gi and G2, that is 

Fk{x) = r 2 fo{u) Gk{u) du, {k = 1, 2), (11) 

J —00 

the ordering G2 >gr Gi implies immediately the stochastic ordering of Fi and F2 in the 
usual sense that F2 is stochastically larger than Fi if Fi(s) > F2{s) for all s. To see this, 
consider first s < 0; then Gi{x) > G2{x) for all x < s, and this clearly implies Fi{s) > F2{s). 
If s > 0, the same conclusion holds by using (10) with x = —s. We have then reached the 
following conclusion. 

Proposition 6 If Gi and G2 satisfy condition (4), and G2 >gr Gi, then the distribution 
functions (11) satisfy 

Fi{x)>F2{x), (xeR). (12) 

Since Go is a monotonically increasing function, then it can be easier to check the ordering 
of Gi and G2 via the ordering of the corresponding u;(x)'s. 

Proposition 7 If Gi = Go{wi{x)) and G2 = Gq{w2{x)) where Gq is as in Lemma 1, and 
wi and W2 are odd functions such that W2{x) > wi^x) for all x > 0, then G2 >gr Gi and 
(12) holds. 

Figure 1 illustrates the order G2 >gr Gi and the stochastic order between the correspond- 
ing distributions functions Fi{x) > F2{x), as stated by Proposition 6. Here /o is the Cauchy 
density, Gq is the Cauchy distribution functions, and two forms of w{x) are considered, 
namely wi{x) = x^ — x, W2{x) = x^. The two perturbation functions Gi{x) = Go{wi{x)) 
and G2{x) = Go{w2{x)) are plotted in the left panel; the right panel displays the corres- 
ponding distribution functions Fi{x) and F2{x). 

The stochastic ordering of the F^s translates immediately into a set of implications about 
ordering of moments and quantiles of the F^s. Specifically, if is a random variable with 
distribution function F^, for k = 1,2, then the following statements hold. 
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Figure 1: Cauchy density function fo{x) perturbed by Go equal to the Cauchy distribution, 
choosing wi{x) = — x, and W2{x) = x^; on the left panel Gi (continuous line) and G2 
(dashed line), on the right Fi (continuous) and F2 (dashed) 

o If Qk{p) denotes p-th quantile of Xk for any < p < 1, then 

Qi{p)<Q2ip), 

and there exists at least one p for which the inequality is strict. 
o For any non-decreasing function t such that the expectations exist, 

E{t{Xi)} < E{t{X2)} (13) 
and the inequality is strict if t is increasing. 

A further specialized case occurs when t{x) = a;^"~^ in (13), for n = 1,2,..., which 
corresponds to the set of odd moments. In this case, (13) improves a result of Umbach 
(2006) stating that 

E{X^'''-^} < E{XI''-^} < E{X^'"-^} 

where Xq has density /o and X^ has density 2/o on the positive axis, which corresponds to 
G{x) = /[o^oo)(2^) (5) the density of Xi corresponds to a Gi which is a distribution 
function. 

It can be noticed that, if G2 >gh Gi >gr Gj^ = |, then the variances of the corresponding 
variables Xk decrease with respect to >gr, that is var{X2} < var{Xi} < var{Xo}, while 
the reverse holds if G+ >gr Gi >gr G2- 

A simple but popular setting where Proposition 6 applies is when w{x) = ax, for some 
real a, leading to the following immediate implication. 

Proposition 8 // /o and Gq are as in Lemma 1, then the set of densities 

f{x;a) = 2fo{x)Go{ax) (14) 
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indexed by the real parameter a are associated to distribution functions which are stochastic- 
ally ordered with a. 

Notice that, when a in (14) is positive, it has a direct interpretation as an inverse scale 
parameter for Go, while it acts as a shape parameter for f{x). Another case of interest is 
given by 

w[x) = ax\ K , 

S u + X'^ 

which occurs in connection with the skew Student's t distribution with v degrees of freedom, 
studied by Azzalini and Capitanio (2003) and others, where /o and Go are of Student's t type 
with V and v + 1 degrees of freedom, respectively. Because of Proposition 7, the distribution 
functions associated to (2) with this choice of w{x) are stochastically ordered with respect 
to a, whether or not /o and Go correspond to a Student's t distribution. 

3.2 On the uniqueness of the mode 

To examine the problem of the uniqueness of the mode of / when d = 1, it is equivalent and 
more convenient to study log/. If /o(a;) and g{x) = G'{x) exist, then 

Kx) = ^log/(x) 

foix) gjx) 

fo{x) ^ G{x) 
= -ho{x) + hg{x) , 

say. The modes of / are a subset of the solutions of the equation 

ho{x) = hg{x) , (15) 

or they are on the extremes of the support, if it is bounded. Since at least one mode always 
exists, we look for conditions to rule out the existence of additional modes. 

For the rest of this subsection, we assume that G(.t) is a monotone function satisfying 
(4) . Without loss of generality, we deal with the case that G is monotonically increasing; for 
decreasing functions, dual conclusions hold. 

In most common cases, /o is unimodal at 0, hence non-decreasing for x < 0. Therefore 
the product /o(a;) G{x) is increasing, and no negative mode can exist. The same conclusion 
holds if /o is increasing and G{x) is non-decreasing for x < 0. 

To ensure that there is at most one positive mode, some additional conditions are required. 
For simplicity of argument, we assume that /o and G have continuous derivative everywhere 
on the support ^o of /o; this means that we are concerned with uniqueness of the solution 
of (15). A sufEcient set of conditions for this uniqueness is that ho{x) is increasing and 
g{x) is decreasing. These requirements imply that /ig(0) > and kg is decreasing, so that 
= /io(0) < hg{Q) and the two functions can cross at most once for a; > 0. When Sq is 
unbounded, a solution of (15) always exists, since ^ and /i^ — as a; — > oo. If Sq 
is bounded, (15) may happen to have no solution; in this case, f{x) is increasing for all x 
and its mode occurs at the supremum of ^o. We summarize this discussion in the following 
statement. 



9 



Tabic 1: Some commonly used densities /o and associated components 



distribution foix) ho{x) ^^{x) 

standard normal ^(a;) x 1 

— 1 2 

logistic 



(1 + e^)2 + 1 (e^ + 1)2 

Subbotin Cj/ exp j sgn(a;) Ixl'^"^ sgn(a;) (z^ — l)|x|''~2 



Student's t^, ( 1 + 



x'^\ + 1 X {u + - x^) 



v J f l+x'^/v {v + x"^)"^ 



Proposition 9 If G{x) in (5) is a increasing function and fo{x) is unimodal at 0, then no 
negative mode exists. If we assume that fo and G have continuous derivative everywhere 
on the support fo, G{x) is concave for x > 0, and fo{x) is log-concave, where at least one 
of these properties holds in a strict sense, then there is a unique positive mode of f{x). If 
G{x) is decreasing, similar statements hold with reversed sign of the mode; uniqueness of the 
negative mode requires that G{x) is convex for x < 0. 

Recall that the property of log-concavity of a univariate density function is equivalent to 
strong unimodality; see for instance Section 1.4 of Dharmadhikari and Joag-dev (1988). 

To check the above conditions in specific instances, it is convenient to work with the 
functions ho and g' , if the latter exists. In the case of increasing w{x), uniqueness of the 
mode is ensured if g'{x) < for a; > and ho{x) is an increasing positive function. In the 
linear case w{x) = ax, log-concavity of fo{x) and unimodality of goix) at suffice to ensure 
unimodality of f{x). 

Tabic 1 recalls some of the more commonly employed density functions fo and their 
associated functions ho and h'g. For the first two distributions of Table 1, and for the 
Subbotin's distribution when u > 1, ho is increasing. If one combines one of these three 
choices of fo with the distribution function of a symmetric density having unique mode at 0, 
then uniqueness of the mode of f{x) follows. Clearly, the condition of unimodality of g{x) 
holds if go is unimodal at and w{x) = ax. The criterion of Proposition 9 does not apply for 
the Student's distribution, since ho{x) is increasing only in the interval (— y^, -y/z7). Hence 
a second intersection with kg cannot be ruled out even if g{x) is decreasing for all x > 0. 
However, for the skew-t distribution, unimodality has been established in the multivariate 
case by Capitanio (2008) and Jamalizadeh and Balakrishnan (2010), and furthermore it 
follows as a corollary of a stronger result to be presented in Section 4. 

The requirement of differentiability of fo and G in Proposition 9 rules out a limited 
number of practically relevant cases. For this reason, we did not dwell on a specific discussion 
of less regular cases. One of the very few relevant distributions which arc excluded occurs 
when fo is the Laplace density function. This case is however included in the discussion of 
the multivariate Subbotin distribution, developed in Section 4.3, when v = 1 and d = 1. 

Although Proposition 9 only gives a set of sufficient conditions for unimodality, the 
condition that g{x) is decreasing for a; > cannot be avoided completely. In other words, 
when / is represented in the form (2), the sole condition of increasing w{x) is not sufficient for 
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nu=lnf. alpha=(Q.1). skewinq=norm 




Figure 2: Case with fo = cj), Gq = ^ , ■w{x) = x^. The left-side panel displays the functions 
ho (continuous), g (dashed), hg (dot-dashed); the right-side panel displays f{x) 

unimodality. This fact is demonstrated by the simple case with fo = (j), Gq = ^, w{x) = x^ , 
whose key features are illustrated in Figure 2. Since tt;'(0) = 0, then g{0) = 0; hence (15) has 
a solution in 0, but the left panel of Figure 2 shows that there are two more intersections of 
ho and hg for a; > 0, one corresponding to an anti-mode and one to a second mode of f{x), 
as visible from the right panel of the figure. 

This case falls under the setting examined by Ma and Genton (2004) who have shown 
that for fo = (j), Go = 'w{x) = ax + I3x^ there are at most two modes. Some additional 
conditions may ensure unimodality: one such set of conditions is a, /3 > and > 6/3. To 
prove that they imply unimodality of /, consider 

d'^log^{w{x)) _ (j){w{x)) 
dx2 ^{wix))"^ 

X {$(u'(x))[(/3x^ + ax)(3/3x2 + af -6f3x] + 0(u;(x))(3/3x2 + af} . 

whose terms inside curly brackets, except —6/3 x, are all positive for x > 0. Since > 6/3, 
then (a^ — 6/3)x is positive, so that this derivative is negative and Go{w{x)) is log-concave 
for a; > 0. For x < 0, we use this other argument: since Go is increasing and log-concave and 
w{x) is concave in the subset x < 0, then the composition Go{w{x)) is log-concave in the 
subset X < 0; see Proposition 10 (iii) below. Since the above second derivative is continuous 
everywhere, then Go{w{x)) is log-concave everywhere. 

4 Quasi-concave and unimodal densities in d dimensions 

A real- valued function / defined on a subset S of M'^ is said to be quasi-concave if the sets 
of the form Cu = {x : f{x) > u} are convex for all positive u. If d = 1, the notion of 
quasi-concavity coincides with uniqueness of the maximum, provided a pole is regarded as 
a maximum point, but for d > 1 the two concepts separate. This motivates the following 
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digression about concavity and related concepts, to develop some tools which will be used 
later on for our main target. 

4.1 Concavity, quasi- concavity and unimodality 

We first recall some standard notions available for instance in Chapter 16 of Marshall and 
Olkin (1979). A real function / defined on a convex subset S of is said to be concave if, 
for every x and y & S and 9 € (0, 1), we have 

f{ex + {i-9)y)>ef{x) + ii-e)f{yy, 

in this case — / is a convex function. A function / is said to be log-concave if log / is concave, 
that is for every x and y E S and G (0, 1) we have 

f{ex + {l-e)y)>f{xff{yf-'. 

The terms strictly concave and strictly log-concave apply if the above inequalities hold in a 
strict sense for all x ^ y and all 6. 

Concave and log-concave functions defined on an open set are continuous. Moreover a 
twice differentiable function is concave (strictly concave) if and only if its Hessian matrix is 
negative semi-definite (negative definite) everywhere on S. 

The next proposition provides the concave and log-concave extension of classical compos- 
ition properties for convex functions such as statement (i) which can be found for example in 
Marshall and Olkin (1979, p. 451) together with its proof; the proofs of the other statements 
are completely analogous. 

Proposition 10 Let h be a real function defined on a convex set S, a subset of W^, and H 
a monotone real function defined on a convex subset of M, such that the composition H{h) 
is defined on S. Then the following properties hold. 

(i) If h is convex and H non- decreasing and convex, then H{h) is convex. Moreover H{h) 
is strictly convex if H is strictly convex, or if h is strictly convex and H is strictly 
monotone. 

(a) Ifh is convex andH non-increasing and log-concave, then H(h) is log-concave. Moreover 
H{h) is strictly log-concave if H is strictly log-concave, or if h is strictly convex and 
H is strictly monotone. The same statements hold replacing the term log-concave by 
concave throughout. 

(Hi) If h is concave and H non- decreasing and log-concave, then 11(h) is log-concave. 
Moreover H{h) is strictly log-concave if H is strictly log-concave, or if h is strictly 
concave and H is strictly monotone. The same statements hold replacing the term 
log- concave by concave throughout. 

We have defined quasi-concavity by requiring convexity of all sets C^. An equivalent 
condition is that, for every x and y G 5 C M"' and 6 G (0,1), we have 

/(0x+(l-%)>min{/(x),/(y)}. 
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Obviously a function which is concave or log-concave is also quasi-concave. Similarly, both 
strict concavity and strict log-concavity imply strict quasi-concavity. 

We now apply the above notions to the case where / represents a probability density 
function on a set S C M*^. The concept of unimodality has a friendly formal definition in 
the univariate case, see for instance Dharmadhikari and Joag-dev (1988, p. 2), but this has 
has no direct equivalent in the multivariate case. Informally, we say that the term mode 
of a density refers to a point where the density takes a maximum value, either globally or 
locally. While a boring formal definition which allows for the non-uniqueness of the density 
function could be given, such a definition is not really necessary for the main aims of the 
present paper, since the density functions which we are concerned with are so regular that 
their modes are either points of (local) maxima or poles. 

The set of the modes of a quasi-concave density is a convex set. Moreover, if / is strictly 
quasi-concave, then the mode is unique. When the mode is unique we say that density / is 
unimodal, and we say that / is c-unimodal if the set of its modes is a convex set. If X is a 
random variable with density function / which is unimodal, we shall say that X is unimodal, 
with slight abuse of terminology. The same convention is adopted for log-concavity, quasi- 
concavity and other properties. 

Another important notion is s-concavity, which helps to make the concept of quasi- 
concavity more tractable. A systematic discussion of s-concavity has been given by Dhar- 
madhikari and Joag-dev (1988); see specifically their Section 3.3, of which we now recall the 
main ingredients. Given a real number s 7^ 0, a density is said to be s-concave on S if 

f{ex + (1 - e)y) > {efixy + (i - e)f{yry/^. 

for all X, y e S and all 9 G (0, 1). 

Clearly, concavity corresponds to s = 1. A density / is s-concave with s < if and only 
if is convex; similarly, a density / is s-concave with s > if and only if is concave. If 
we call (— oo)-concave a function which is quasi-concave and 0-concave a function which is 
log-concave, then the class of sets of s-concave functions is increasing when s decreases; in 
other words, if / is s-concave, then it is r-concave for any r < s. Finally, notice that is easy 
to adapt Proposition 10 to s-concave functions. 

The closure with respect to marginalization of s-concave densities depends on the value of 
s and on the dimensions of the spaces, as indicated by the next proposition, which essentially 
is Theorem 3.21 of Dharmadhikari and Joag-dev (1988). 

Proposition 11 Let f be an s-concave density on a convex set S in M'*+"^, and fa be the 

marginal density of f on an d- dimensional subspace. If s > —1/m, then fd is Sm-concave 
on the projection of the support of f, where Sm = s/{l + ms), with the convention that, if 
s = —1/m, then Sm = —00. 

Notice that this result includes the fact that the class of log-concave densities is closed with 

respect to marginalization. In addition, from a perusal of the proof of the above-quoted 
Theorem 3.21, we obtain that the marginal densities are strictly s^-concave provided / is 
strictly s-concave or the set S is strictly convex. 
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4.2 Skew-elliptical distributions generated by conditioning 



A d-dimensional random variable U is said to have an elhptical density, with density generator 
function /, if its density fu is of the form 

fu{y) = k~f{y'n-^y), (16) 

where O is a d-dimensional positive definite matrix, the function / : (0, +oo) M"'' is such 
that x'^/'^~^f{x) has finite integral on (0, +oo) and A; is a suitable constant which depends 
on d and det(r2). In this case, we shall use the notation U ~ fd(0, 0, /). 

Note that an elliptical density / is c-unimodal if and only if its density generator is non- 
increasing, and it is unimodal if and only if its density generator is decreasing. Then it turn 
out that / is c-unimodal if and only if it is quasi- concave, and it is unimodal if and only if 
it is strictly quasi-concave. 

An initial formulation of skew-elliptical distribution has been considered by Azzalini and 
Capitanio (1999), which was of type (2) with /o of elliptical class and w{x) linear. Another 
formulation of skew-elliptical distribution has been put forward by Bianco and Dey (2001), 
whose key ingredients are now recalled. Consider a, {d + l)-dimensional random variable 

U={^^^^£d+M^+,}), where f^+=(j 'll)^^' ^^^^ 

and Uq and Ui have dimension 1 and d, respectively; for our aims, there is no loss of 
generality in assuming that the diagonal elements of 0+ are all I's. Then a random variable 
Z = {Ui\Uq > 0) is said to have a skew-elliptical distribution, and its density function at 
ui G is 

f + OO 

fz{ui) = 2 kJ{u^n-\)duo (18) 
Jo 

where = (uq, uJ). This construction arises as an extension of one of the mechanisms for 
generating the skew-normal distribution to the case of elliptical densities, but the study of 
the connections with other densities of type (2) was not an aim of Branco and Dey (2001). 

Consequently, one question investigated by Azzalini and Capitanio (2003) was whether 
all distributions of type (18) are of type (2), with the requirement that /q is the density of an 
elliptical d-dimensional distribution. The conjecture has been proved for a set of important 
cases, notably the multivariate skew-normal and the skew-t distributions, among others, but 
a general statement could not be obtained. This general conclusion is however quite simple 
to reach using representation (5), and recalling that Branco and Dey (2001) have proved 
that (18) can be written as 

fz{y) = 2 fo{y)Fy{a'^y), (y G R"), (19) 

where /o is the density of an elliptical d-dimensional distribution, and Fy is a cumulative 
distribution function of a symmetric univariate distribution, which depends on y only through 
y^O^^^y. Since Fy = F^y, then it is immediate that G{y) = Fy{a^ y) satisfies (4). Hence 
(19) allows a representation of type (5), and via (6) also of type (2). 

Proposition 12 Assume that the random variable U in (17) is c-unimodal. If f is log- 
concave, then the elliptical densities of U and U\ and the skew- elliptical density of Z are 
log-concave. Moreover they are strictly log-concave if U is unimodal or f is strictly log- 
concave or the support of f is bounded. 
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Proof. Function h{u) = vJQ,'^ u is strictly convex. Since U is c-unimodal, then / is 
non-increasing, moreover it is log-concave; therefore f{vJQ.'^u) is log-concave by Proposi- 
tion 10 (ii). Then both U and {U\Uq > 0) have log-concave densities. Since the marginals of 
a log-concave density are log-concave, then log-concavity of Ui and Z holds by (18). Now, if 
U is unimodal, / is decreasing, and f{uJQ,'^^u) is strictly log-concave, by Proposition 10 (i). 
If / is strictly log-concave, then f{vJfl^^u) is strictly log-concave. Finally, if the support 
of / is bounded, then the support of U is strictly convex and, by Proposition 10 (i), also in 
this case f{uJQ,^^u) is strictly log-concave. Then, in all three cases, strict log-concavity of 
Ui and Z holds by recalling the remark following Proposition 11. 

This proposition is a special case of the more general result which follows, but we keep 
Proposition 12 separate both because of the special role of log-concavity and because this 
arrangement allows a more compact exposition of the combined discussion. 

Proposition 13 Assume that the random variable U in (17) is c-unimodal. If f is s- 
concave, with s > —1, then U has s-concave density, whereas the elliptical density ofUi and 
the skew- elliptical density of Z are si-concave, with Si = s/{l-\-s). Moreover all conclusions 
hold strictly if U is unimodal or f is strictly s-concave or the support of f is bounded. 

Proof. The function h(u) = vJO.'^u is strictly convex. Since U is c-unimodal, then / 
is non-increasing and moreover it is s-concave. We now examine properties of concavity 

separating the case s < and s > 0; the case s = 0, which corresponds to log-concavity, has 
already been handled in Proposition 12. If s < then f^ is non-decreasing and convex. Then 
f'ivJ VL~^^u) = {/(u^O^^w)}* is convex by Proposition 10 (i) and f{u^VL^^u) is s-concave. 
On the other hand, if s > then is non- increasing and concave. Then f^{vJ^'^u) = 
{f{vijQ.'^u)"\^ is concave by Proposition 10 (ii) and f{u^Qr^u) is s-concave. Then both U 
and {U\Uq > 0) have s-concave densities. Now, the claim about the densities of Ui and Z 
follows from Proposition 11 by taking into account (18). The final statement follows by the 
same type of argument used in the proof of Proposition 12. 

Note that, in the special case of a concave density generator, the support is bounded, and 
both the marginal density on and the skew-symmetric density of Z are not necessarily 
concave. However, using Proposition 13 with s = 1, strict 1/2-concavity of their densities 
follows, and this fact implies strictly log-concavity. 

The results of Proposition 12 and Proposition 13 allow to handle several classes of dis- 
tributions, of which we now sketch the more noteworthy cases. 

A important specific instance is the multivariate skew-normal density which can be rep- 
resented by a conditioning method. For an expression of the multivariate skew-normal den- 
sity, see for instance (16) of Azzalini (2005). Since the density generator of the normal 
family, f{x) = exp(— x/2), is decreasing and log-concave, then from Proposition 12 we ob- 
tain log-concavity of the skew-normal family. This conclusion is however a special case of a 
more general result on log-concavity of the SUN distribution obtained by Jamalizadeh and 
Balakrishnan (2010); see their Theorem 1. 

The {d -\- l)-dimensional Pearson type II distributions for which f{x) = (1 — xY , where 
X € (0, 1) and > 0, satisfies the conditions of Proposition 13. In fact it is non-increasing 
and -concave on a bounded support. Then the skew-elliptical d-dimensional density is 
strictly {v + l)~^-concave and therefore strictly log-concave. The density function of the 
skew- type II density function is given by (22) of Azzalini and Capitanio (2003). 
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In addition, Proposition 13 holds for the Pearson type VII distributions, and in particular 
for the Student's distribution. In this case the density generator is given by 

f{x) = (1 + x/i^)-^ {v < 0, (d + l)/2 < M). (20) 

and M = {d + i/+ l)/2 for the Student's density. Such generator is decreasing and s-concave 
with s = — 1/M; in fact f{x)~^/^ is convex. Since s > —1, then Proposition 13 applies and 
the skew-t is si-concave with si = — 1/(M — 1), and si = — 2/(d + v — 1) in the Student's 
case. These densities are not log-concave, but they are still strictly quasi-concave. Hence 
unimodality follows. For expressions of the multivariate skew-type VII and skew-t density, 
see (21) and (26) of Azzalini and Capitanio (2003), respectively. 

The above results establish not only unimodality of the more appealing subset of the 
skcw-clliptical family of distributions, namely those of type (18), but also the much stronger 
conclusion of quasi-concavity of these densities. It is intrinsic to the nature of skew-elliptical 
densities that they do not have highest density regions of elliptical shape, but it is reassuring 
that they maintain a qualitatively similar behaviour, in the sense that convexity of these 
regions, Cu in our notation, holds as long as the parent {d+ l)-dimensional elliptical density 
enjoys a qualitatively similar property but in a somewhat stronger variant, specifically s- 
concavity with s > — 1. 

Note that there is no hope to extend Proposition 12 to quasi-concave densities, in the sense 
that a skew-symmetric generated by conditioning a quasi-concave density is not necessarily 
quasi-concave as demonstrated by the following construction. 



Example Consider U = {Uo,Ui)~^ ^ £"2(0,^^+,/), where 

/ = ^(0,1) + ^(0,4^) and 0+ = ^ ^^^^ , 
whose density function is 

fu{x,y) = k{Isi{x,y) + Isi{x,y)} 

where Sj = {(.t, y) € : + — < 3j^/4}, j = 1, 4, and k is the normalizing constant 
given by A; = l/{Ai + A4) « 0.0216 where Aj = . Then both Uq, and Ui have 

common support [—4, 4] and density function 

fuoix) = fuA^) = k [^/3{l-x^)I^_,,^){x) + y/3{16-y^)) . 

Because of (16) and (18), the density of Z = {Ui\Uo > 0) is given by 

fz{y) = k{h{y) + U{y)} 

where 



r+00 
^0 



' y + V^if - y^) if -V3j/2 <y< V3j/2, 



fj {y) = ^ I Is, (x, y)dx=l 2^3{f - y2) if V3j/2 < y < J , 

, otherwise, 

for J = 1,4, and it is displayed in Figure 3. The global maximum of fz is where k{2y + 
V^3(T6 - y^) + y^3(l — x^)) takes its maximum value, that is at y 0.699. When y > 1, 
fz = kf4 and there is another local maximum at y = 2. Therefore, fz is not unimodal. 

To conclude with, while the density of U is quasi-concave, the skew-elliptical variable Z 
generated by conditioning is not quasi-concave. 
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Figure 3: Density function fzi^), exhibiting lack of quasi-concavity, obtained by conditioning 
of a bivariate elliptical quasi-concave distribution 



4.3 Log-concavity of other families of distributions 

There are several other families of distributions which belong to the area of interest of the 
stream of literature described at the beginning of this paper but are not included in the 
conditioning mechanism of an elliptical distribution considered in § 4.2. This section deals 
with log-concavity of some of these other families, making use of the following immediate 
implication of Proposition 10. 

Corollary 14 If qo is a log-concave function defined on a convex set S Q M'^, and H and h 
are as in Proposition 10, either (ii) or (Hi), then 



is log-concave on S. 

Example The density function on the real line introduced by Subbotin (1923) has been 
variously denoted by subsequent authors as exponential power distribution, generalized error 
distribution and normal distribution of order v. Its multivariate version is 



where C is a symmetric positive definite matrix, is a positive parameter and Cu a normaliz- 
ation constant. For = 2 and = 1, fu lends the multivariate normal and the multivariate 
Laplace density, respectively. 





(21) 




We first want to show that f,y is log-concave if v > 1. Consider h{x) = (x'^Cx)^^'^ whose 
Hessian matrix is 




h{x) 



-3 




dx dx 
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say. To show that this Hessian is positive semi-definite, it is sufficient to prove this fact for 
matrix M, since h{x) > 0. For any u G W^, write 

u^Mu = {x'^Cx){u^Cu) - {u^Cx){x'^Cu) = \\uf \\xf - {u^xf 

where u = C^^^u and x = C^l'^x for any square root C^/^ of C, and from the Cauchy-Schwarz 
inequahty we conclude that vJMu > 0. Then h is convex. Next, write 

— log fi,{x) = constant -|- h{xy /v 

and observe that, since is a strictly convex for t > 0, then — log fu is convex for i/ > 1 
and strictly convex for z/ > 1 by Proposition 10 (i). Hence is log-concave for v > 1 and 
strictly log-concave for u > 1. 

Now wc introduce a skewed version of /,y of type (2). If we aim at obtaining a density 
which fulfils the requirements of both Lemma 1 and Corollary 14, then H = Gq is non- 
decreasing, while function h = w must be odd and concave, hence it has to be linear. We 
then focus on the density function 

f{x) = 2U{x)Go{a^x), (xGM'^), (22) 

where Go is a distribution function on M, symmetric about 0. 

Among the many options for Gq, a quite natural choice is to take Gq equal to the 
distribution function of fi/ in the scalar case, that is 

g.w4(i+sgn(t) ^<|f^/;;-;/-'' ). ,.m, 

where 7 denotes the lower incomplete gamma function. This choice of Gq has been examined 
by Azzalini (1986) in the case ci = 1 of (22). He has shown that Gq is strictly log-concave 
if u > 1, leading to log-concavity of (22) when d = 1. The case u = 1 which corresponds 
to the Laplace distribution function is easily handled by direct computation of the second 
derivative to show strict log-concavity of Gq- Now, combining strict log-concavity of Gq with 
log-concavity of fu proved above, an application of Corollary 14 shows that (22) is strictly 
log-concave on M*^ if > 1. 

Although (22) is of skew-elliptical type, it is not of the type generated by the conditioning 
mechanism of a (d -|- l)-dimensional elliptical variate considered in Section 4.2. In fact, the 
results of Kano (1994) show that the set of densities fu is not closed under marginalization, 
and this fact affects the conditioning mechanism (18) as well. 

As an example of non-elliptical distribution, we can consider a d-fold product of univariate 
Subbotin's densities, that is 

d 

and this density can be used as a replacement of fi, in (22). Since each factor of this product 
is log-concave, ii v > 1, the same property holds for /*. Strict log-concavity holds for 
2/*(x)Go(a^x) as well, using again strict log-concavity of Gq. 
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Example To illustrate the applicability of Corollary 14 to distributions outside the set of 
type (2), consider the so-called extended skew-normal density which in the c/-dimensional 
case takes the form 

f{x) = U^;n) ^(«o + «^^) ^ {xeR% (23) 

where r G M and ao = T{a'^Qa)^/'^ . Although this distribution does not quite fall under the 
umbrella of Lemma 1 unless r = 0, its constructive argument is closely related. 

To show log-concavity of (23), first recall the well-known fact that (j)(i{x;^) is strictly 
log-concave. Moreover $ is log-concave, as if follows by direct calculation of the second 
derivative of log$, taking into account the well-known fact —y^{y) < 4>{y) for every y < 0. 
In addition, since $ is strictly increasing and ao + x is concave in a non-strict sense, 
Corollary 14 applies to conclude that (23) is strictly log-concave. 

Although this conclusion is a special case of the result of Jamalizadeh and Balakrishnan 
(2010) concerning log-concavity of the SUN distribution, it has however been presented 
because the above argument is different. 
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